i2c-net: Using Instance-Level Neural Networks for Monocular Category-Level 6D Pose Estimation
نویسندگان
چکیده
Object detection and pose estimation are strict requirements for many robotic grasping manipulation applications to endow robots with the ability grasp objects different properties in cluttered scenes various lighting conditions. This work proposes framework i2c-net extract 6D of multiple belonging categories, starting from an instance-level network relying only on RGB images. The is trained a custom-made synthetic photo-realistic dataset, generated some base CAD models, opportunely deformed, enriched real textures domain randomization purposes. At inference time, employed combination 3D mesh reconstruction module, achieving category-level capabilities. Depth information used post-processing as correction. Tests conducted YCB-V NOCS-REAL datasets outline high accuracy proposed approach.
منابع مشابه
Bridging Category-level and Instance-level Semantic Image Segmentation
We propose an approach to instance-level image segmentation that is built on top of category-level segmentation. Specifically, for each pixel in a semantic category mask, its corresponding instance bounding box is predicted using a deep fully convolutional regression network. Thus it follows a different pipeline to the popular detect-then-segment approaches that first predict instances’ boundin...
متن کاملInstance Level Classification Confidence Estimation
Often the confidence of a classification prediction can be as important as the prediction itself although current classification confidence measures are not necessarily consistent between different data sets. Thus in this paper, we present an algorithm to predict instance level classification confidence that is more consistent between data sets and is intuitive to interpret. The results with fi...
متن کاملHead Pose Estimation Using Convolutional Neural Networks
Detection and estimation of head pose is fundamental problem in many applications such as automatic face recognition, intelligent surveillance, and perceptual human-computer interface and in an application like driving, the pose of the driver is used to estimate his gaze and alertness, where faces in the images are non-frontal with various poses. In this work head pose of the person is used to ...
متن کاملHead pose Estimation Using Convolutional Neural Networks
Head pose estimation is a fundamental problem in computer vision. Several methods has been proposed to solve this problem. Most existing methods use traditional computer vision methods and existing method of using neural networks works on depth bitmaps. In this project, we explore using convolutional neural networks (CNNs) that take RGB image as input to estimate the head pose. We use regressio...
متن کاملMonocular Head Pose Estimation
This work addresses the problem of human head pose estimation from single view images. 3D rigid head pose is estimated combining Active Appearance Models (AAM) with Pose from Orthography and Scaling with ITerations (POSIT). AAM shape landmarks are tracked over time and used in POSIT for pose estimation. A statistical anthropometric 3D model is used as reference. Several experiences were perform...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE robotics and automation letters
سال: 2023
ISSN: ['2377-3766']
DOI: https://doi.org/10.1109/lra.2023.3240362